NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Privacy-Preserving Deep Action Recognition: An Adversarial Learning Framework and A New Dataset

https://doi.org/10.1109/TPAMI.2020.3026709

Wu, Zhenyu; Wang, Haotao; Wang, Zhaowen; Jin, Hailin; Wang, Zhangyang (September 2020, IEEE Transactions on Pattern Analysis and Machine Intelligence)
null (Ed.)
Full Text Available
Adversarial Graph Embedding for Ensemble Clustering

https://doi.org/10.24963/ijcai.2019/494

Tao, Zhiqiang; Liu, Hongfu; Li, Jun; Wang, Zhaowen; Fu, Yun (August 2019, International Joint Conferences on Artificial Intelligence Organization)

Ensemble clustering generally integrates basic partitions into a consensus one through a graph partitioning method, which, however, has two limitations: 1) it neglects to reuse original features; 2) obtaining consensus partition with learnable graph representations is still under-explored. In this paper, we propose a novel Adversarial Graph Auto-Encoders (AGAE) model to incorporate ensemble clustering into a deep graph embedding process. Specifically, graph convolutional network is adopted as probabilistic encoder to jointly integrate the information from feature content and consensus graph, and a simple inner product layer is used as decoder to reconstruct graph with the encoded latent variables (i.e., embedding representations). Moreover, we develop an adversarial regularizer to guide the network training with an adaptive partition-dependent prior. Experiments on eight real-world datasets are presented to show the effectiveness of AGAE over several state-of-the-art deep embedding and ensemble clustering methods.
more » « less
Full Text Available
Visual to Sound: Generating Natural Sound for Videos in the Wild

Zhou, Yipin; Wang, Zhaowen; Fang, Chen; Bui, Trung; Berg, Tamara L. (June 2018, IEEE Conference on Computer Vision and Pattern Recognition)

As two of the five traditional human senses (sight, hearing, taste, smell, and touch), vision and sound are basic sources through which humans understand the world. Often correlated during natural events, these two modalities combine to jointly affect human perception. In this paper, we pose the task of generating sound given visual input. Such capabilities could help enable applications in virtual reality (generating sound for virtual scenes automatically) or provide additional accessibility to images or videos for people with visual impairments. As a first step in this direction, we apply learning-based methods to generate raw waveform samples given input video frames. We evaluate our models on a dataset of videos containing a variety of sounds (such as ambient sounds and sounds from people/animals). Our experiments show that the generated sounds are fairly realistic and have good temporal synchronization with the visual inputs.
more » « less
Full Text Available
Towards Privacy-Preserving Visual Recognition via Adversarial Training: A Pilot Study

https://doi.org/10.1007/978-3-030-01246-5

Wu, Zhenyu; Wang, Zhangyang; Wang, Zhaowen; Jin, Hailin (January 2018, European Conference on Computer Vision)

Full Text Available

Search for: All records